Generality Is Predictive of Prediction Accuracy

نویسندگان

  • Geoffrey I. Webb
  • Damien Brain
چکیده

During knowledge acquisition multiple alternative potential rules all appear equally credible. This paper addresses the dearth of formal analysis about how to select between such alternatives. It presents two hypotheses about the expected impact of selecting between classification rules of differing levels of generality in the absence of other evidence about their likely relative performance on unseen data. It is argued that the accuracy on unseen data of the more general rule will tend to be closer to that of a default rule for the class than will that of the more specific rule. It is also argued that in comparison to the more general rule, the accuracy of the more specific rule on unseen cases will tend to be closer to the accuracy obtained on training data. Experimental evidence is provided in support of these hypotheses. We argue that these hypotheses can be of use in selecting between rules in order to achieve specific knowledge acquisition objectives.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison Upper Lip Bite Test and Mallampati Test in the Prediction of Difficult Laryngoscopy

Background and Objective: Intubation is the most common method for the management of the upper airway in general anesthesia. In this regard, the prediction of the ease or difficulty of intubation is of paramount importance for the anesthesia team. The main cause of anesthesia-related mortality is failed airway management. The present study aimed to compare the accuracy of the upper lip bite (UL...

متن کامل

Performance Evaluation of Dynamic Modulus Predictive Models for Asphalt Mixtures

Dynamic modulus characterizes the viscoelastic behavior of asphalt materials and is the most important input parameter for design and rehabilitation of flexible pavements using Mechanistic–Empirical Pavement Design Guide (MEPDG). Laboratory determination of dynamic modulus is very expensive and time consuming. To overcome this challenge, several predictive models were developed to determine dyn...

متن کامل

An Efficient Predictive Model for Probability of Genetic Diseases Transmission Using a Combined Model

In this article, a new combined approach of a decision tree and clustering is presented to predict the transmission of genetic diseases. In this article, the performance of these algorithms is compared for more accurate prediction of disease transmission under the same condition and based on a series of measures like the positive predictive value, negative predictive value, accuracy, sensitivit...

متن کامل

Accuracy of obesity indices alone or in combination for prediction of diabetes: A novel risk score by linear combination of general and abdominal measures of obesity

Background: The predictive power of obesity measures varies according to the presence of coexistent measures. The present study aimed to determine the predictive power of combinations of obesity measures for diabetes by calculation of a linear risk score. Methods: Data from a population-based cross-sectional study of 994 representative samples of Iranian adults in Babol, Iran were analyzed. Me...

متن کامل

طراحی شبکه عصبی مصنوعی برای پیش‌بینی توأم سندرم متابولیک و شاخص مقاومت به انسولین (HOMA-IR): مطالعه قند و لیپید تهران

  Background & Objective: Mixed outcomes arise when, in a multivariate model, response variables measured on different scales such as binary and continuous. In a bivariate modeling, when there are mixed response variables, the common methods in classic statistics have shortcomings. This study aimed at designing an appropriate ANN model for modeling and predicting the bivariate mixed responses i...

متن کامل

Prediction of true critical temperature and pressure of binary hydrocarbon mixtures: A Comparison between the artificial neural networks and the support vector machine

Two main objectives have been considered in this paper: providing a good model to predict the critical temperature and pressure of binary hydrocarbon mixtures, and comparing the efficiency of the artificial neural network algorithms and the support vector regression as two commonly used soft computing methods. In order to have a fair comparison and to achieve the highest efficiency, a comprehen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006